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' We consider online resource allocation problems where given a set of requests our goal is to select 

a subset that maximizes a value minus cost type of objective function. Requests are presented online 
in random order, and each request possesses an adversarial value and an adversarial size. The online 
algorithm must make an irrevocable accept/reject decision as soon as it sees each request. The "profit" 
■ of a set of accepted requests is its total value minus a convex cost function of its total size. This problem 

I falls within the framework of secretary problems. Unlike previous work in that area, one of the main 

^ • challenges we face is that the objective function can be positive or negative and we must guard against 

Q , accepting requests that look good early on but cause the solution to have an arbitrarily large cost as more 

requests are accepted. This requires designing new techniques. 

We study this problem under various feasibility constraints and present online algorithms with com- 
^ ■ petitive ratios only a constant factor worse than those known in the absence of costs for the same feasi- 

I bility constraints. We also consider a multi-dimensional version of the problem that generalizes multi- 

dimensional knapsack within a secretary framework. In the absence of any feasibility constraints, we 
present an 0{£) competitive algorithm where i is the number of dimensions; this matches within constant 
factors the best known ratio for multi-dimensional knapsack secretary. 
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1 Introduction 



We study online resource allocation problems under a natural protit objective: a single server accepts or 
^ ' rejects requests for service so as to maximize the total value of the accepted requests minus the cost imposed 

■ by them on the system. This model captures, for example, the optimization problem faced by a cloud 

computing service accepting jobs, a wireless access point accepting connections from mobile nodes, or an 
advertiser in a sponsored search auction deciding which keywords to bid on. In many of these settings, the 
server must make accept or reject decisions in an online fashion as soon as requests are received without 
knowledge of the quality future requests. We design online algorithms with the goal of achieving a small 
competitive ratio — ratio of the algorithm's performance to that of the best possible (offline optimal) solution. 

A classical example of online decision making is the secretary problem. Here a company is interested in 
hiring a candidate for a single position; candidates anive for interview in random order, and the company 
must accept or reject each candidate following the interview. The goal is to select the best candidate as 
often as possible. What makes the problem challenging is that each interview merely reveals the rank of the 
candidate relative to the ones seen previously, but not the ones following. Nevertheless, Dynkin ifTTI showed 
that it is possible to succeed with constant probability using the following algorithm: unconditionally reject 
the first 1/e fraction of the candidates; then hire the next candidate that is better than all of the ones seen 
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previously. Dynkin showed that as the number of candidates goes to infinity, this algorithm hires the best 
candidate with probability approaching 1/e and in fact this is the best possible. 

More general resource allocation settings may allow picking multiple candidates subject to a certain 
feasibility constraint. We call such a problem a generalized secretary problem (GSP) and use to 
denote an instance of the problem. Here T denotes a feasibility constraint that the set of accepted requests 
must satisfy (e.g. the size of the set cannot exceed a given bound), and <^> denotes an objective function 
that we wish to maximize. As in the classical setting, we assume that requests arrive in random order; the 
feasibility constraint T is known in advance but the quality of each request, in particular- its contribution 
to <l>, is only revealed when the request amves. Recent work has explored variants of the GSP where $ 
is the sum over the accepted requests of the "value" of each request. For such a sum-of-values objective, 
constant factor competitive ratios are known for various kinds of feasibility constraints including cardinality 
constraints ifTTlfTQl . knapsack constraints |l4j|, and certain matroid constraints i5|. 

In many settings, the linear sum-of-values objective does not adequately capture the tradeoffs that the 
server faces in accepting or rejecting a request, and feasibility constraints provide only a rough approxima- 
tion. Consider, for example, a wireless access point accepting connections. Each accepted request improves 
resource utilization and brings value to the access point. However as the number of accepted requests grows 
the access point performs greater multiplexing of the spectrum, and must use more and more transmitting 
power in order to maintain a reasonable connection bandwidth for each request. The power consumption 
and its associated cost are non-linear functions of the total load on the access point. This directly translates 
into a value minus cost type of objective function where the cost is an increasing function of the load or total 
size of all the requests accepted. 

Our goal then is to accept a set A out of a universe U of requests such that the "profit" it {A) = v{A) — 
C{s{A)) is maximized; here ^(^4) is the total value of all requests in A, s{A) is the total size, and C is a 
known increasing convex cost functioifl 

Note that when the cost function takes on only the values and oo it captures a knapsack constraint, 
and therefore the problem (tt, 2^) (i.e. where the feasibility constraint is trivial) is a generalization of the 
knapsack secretary problem llH. We further consider objectives that generalize the ^-dimensional knapsack 
secretary problem. Here, we are given i different (known) convex cost functions C, for 1 < i < ^, and 
each request is endowed with £ sizes, one for each dimension. The profit of a set is given by vr(^) = 
'^{^) ~ Z^i=i where Si{A) is the total size of the set in dimension i. 

We consider the profit maximization problem under various feasibility constraints. For single-dimensional 
costs, we obtain online algorithms with competitive ratios within a constant factor of those achievable for a 
sum-of-values objective with the same feasibility constraints. For ^-dimensional costs, in the absence of any 
constraints, we obtain an 0{t) competitive ratio. We remark that this is essentially the best approximation 
achievable even in the offline setting: Dean et al. [91 show an Q.{£'^~^) hardness for the simpler ^-dimensional 
knapsack problem under a standard complexity-theoretic assumption. For the multi-dimensional problem 
with general feasibility constraints, our competitive ratios are worse by a factor of 0{l^) over the corre- 
sponding versions without costs. Improving this factor is a possible avenue for future research. 

We remark that the profit function tt is a submodular function. Recently several works llT3l |6l [HI have 
looked at secretary problems with submodulai^ objective functions and developed constant competitive algo- 
rithms. However, all of these works make the crucial assumption that the objective is always nonnegative; 
it therefore does not capture vr as a special case. In particular, if <I> is a monotone increasing submodular 
function (that is, if adding more elements to the solution cannot decrease its objective value), then to obtain 

'Convexity is crucial in obtaining any non-trivial competitive ratio — if the cost function were concave, the only solutions with 
a noimegative objective function value may be to accept everything or nothing. 
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a good competitive ratio it suffices to show that the online solution captures a good fraction of the optimal 
solution. In the case of fSl and |[T6l . the objective function is not necessarily monotone. Nevertheless, 
nonnegativity implies that the universe of elements can be divided into two parts, over each of which the 
objective essentially behaves like a monotone submodular function in the sense that adding extra elements 
to a good subset of the optimal solution does not decrease its objective function value. In our setting, in 
contrast, adding elements with too large a size to the solution can cause the cost of the solution to become 
too large and therefore imply a negative profit, even if the rest of the elements are good in terms of their 
value-size tradeoff. As a consequence we can only guai^antee good profit when no "bad" elements are added 
to the solution, and must ensure that this holds with constant probability. This necessitates designing new 
techniques. 

Our techniques. In the absence of feasibility constraints (see Section O, we note that it is possible to 
classify elements as "good" or "bad" based on a threshold on their value to size ratio (a.k.a. density) such that 
any large enough subset of the good elements provides a good approximation to profit; the optimal thi^eshold 
is defined according to the offline optimal fractional solution. Our algorithm learns an estimate of this 
threshold from the first few elements (that we call the sample) and accepts all the elements in the remaining 
stream that cross the threshold. Learning the threshold from the sample is challenging. First, following the 
intuition about avoiding all bad elements, our estimate must be conservative, i.e. exceed the true threshold, 
with constant probability. Second, the optimal threshold for the sample can differ significantly from the 
optimal threshold for the entire stream and is therefore not a good candidate for our estimate. Our key 
observation is that the optimal profit over the sample is a much better behaved random variable and is, in 
particular, sufficiently concentrated; we use this observation to carefully pick an estimate for the density 
threshold. 

With general feasibility constraints, it is no longer sufficient to merely classify elements as good and 
bad: an arbitrary feasible subset of the good elements is not necessarily a good approximation. Instead, we 
decompose the profit function into two parts, each of which can be optimized by maximizing a certain sum- 
of-values function (see Section |4l). This suggests a reduction from our problem to two different instances 
of the GSP with sum-of-values objectives. The catch is that the new objectives are not necessarily non- 
negative and so previous approaches for the GSP don't work directly. We show that if the decomposition of 
the profit function is done with respect to a good density threshold and an extra filtering step is applied to 
weed out bad elements, then the two new objectives on the remaining elements are always non-negative and 
admit good solutions. At this point we can employ previous work on GSP with a sum-of-values objective 
to obtain a good approximation to one or the other component of profit. We note that while the exposition 
in Section|4]focuses on a matroid feasibility constraint, the results of that section extend to any downwards- 
closed feasibility constraint that admits good offline and online algorithms with a sum-of-values objectiv^ 

In the multi-dimensional setting (discussed in Section O, elements have different sizes along different 
dimensions. Therefore, a single density does not capture the value-size tradeoff that an element offers. 
Instead we can decompose the value of an element into I different values, one for each dimension, and 
define densities in each dimension accordingly. This decomposes the profit across dimensions as well. 
Then, at a loss of a factor of i, we can approximate the profit objective along the "best" dimension. The 
problem with this approach is that a solution that is good (or even best) in one dimension may in fact be 
terrible with respect to the overall profit, if its profit along other dimensions is negative. Surprisingly we 
show that it is possible to partition values across dimensions in such a way that there is a single ordering over 

^We obtain an 0(a''/3) competitive algorithm wliere a is the best offline approximation and /? is the best online competitive 
ratio for the sum-of-values objective. 
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elements in terms of their value-size tradeoff that is respected in each dimension; this allows us to prove that 
a solution that is good in one dimension is also good in other dimensions. We present an 0{£) competitive 
algorithm for the unconstrained setting based on this approach in Section |5] and defer a discussion of the 
constrained setting to Section [6l 

Related work. The classical secretary problem has been studied extensively; see lfT4l[T5l and ||23]| for a 
survey. Recently a number of papers have explored variants of the GSP with a sum-of-values objective. 
Hajiaghayi et al. lITTl considered the variant where up to k secretaries can be selected (a.k.a. the /c-secretary 
problem) in a game-theoretic setting and gave a strategyproof constant-competitive mechanism. Klein- 
berg ||T9| later showed an improved 1 — 0{1/Vk) competitive algorithm for the classical setting. Babaioff 
et al. PI generalized this to a setting where different candidates have different sizes and the total size of the 
selected set must be bounded by a given amount, and gave a constant factor approximation. In |i5j| Babaioff 
et al. considered another generalization of the fc-secretary problem to matroid feasibility constraints. A 
matroid is a set system over U that is downwards closed (that is, subsets of feasible sets are feasible), and 
satisfies a certain exchange property (see f2T\ for a comprehensive treatment). They presented an O(logr) 
competitive algorithm, where r is the rank of the matroid, or the size of a maximal feasible set. This was 
subsequently improved to a O ( \/log r ) -competitive algorithm by Chakraborty and Lachish Q. Several pa- 
pers have improved upon the competitive ratio for special classes of matroids |[Tl[T0l|20l. Bateni et al. 161 
and Gupta et al. |[T6l were the first to (independently) consider non-linear objectives in this context. They 
gave online algorithms for non-monotone nonnegative submodular objective functions with competitive ra- 
tios within constant factors of the ratios known for the sum-of-values objective under the same feasibility 
constraint. Other versions of the problem that have been studied recently include: settings where elements 
are drawn from known or unknown distributions but arrive in an adversarial order |[8l[T8l|22l, versions where 
values are permuted randomly across elements of a non-symmetric set system |[24l . and settings where the 
algorithm is allowed to reverse some of its decisions at a cost IHO. 

2 Notation and Preliminaries 

We consider instances of the generalized secretary problem represented by the pair (vr, J^), and an implicit 
number n of requests or elements that arrive in an online fashion. U denotes the universe of elements. 

C 2^ is a known downwards-closed feasibility constraint. Our goal is to accept a subset of elements 
A C U with A G T such that the objective function vr(^) is maximized. For a given set T C C/, we use 
0*{T) = argmaXy^gjrn2T T^iA) to denote the optimal solution over T; O* is used as shorthand for 0*{U). 
We now describe the function vr. 

In the single-dimensional cost setting, each element e G ?7 is endowed with a value v{e) and a size 
s{e). Values and sizes are integral and ai^e a priori unknown. The size and value functions extend to sets 
of elements as s{A) = ^^eeA^i^) ^(^) = l^eGA^(^)- Then the "profit" of a subset is given by 
7r(^) = v{A) — C{s{A)) where C is a non-decreasing convex function on size: C : Z+ — >• Z"*". The 
following quantities will be useful in our analysis: 

• The density of an element, p(e) := v{e)/s{e). We assume without loss of generality that densities of 
elements are unique and denote the unique element with density 7 by e^. 

• The marginal cost function, c(s) := C(s) — C(s — 1). Note that this is an increasing function. 
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• The inverse marginal cost function, s(p) which is defined to be the maximum size for which an 
element of density p will have a non-negative profit increment, that is, the maximum s for which 

p>c{s). 

• The density prefix for a given density 7 and a set T, := {e € T : p{e) > 7}, and the partial 
density prefix, := We use P^ and Py as shorthand for P^ and respectively. 

We will sometimes find it useful to discuss fractional relaxations of the offline problem of maximizing 
vr subject to T. To this end, we extend the definition of subsets of U to allow for fractional membership. 
We use ae to denote an a-fraction of element e; this has value v{ae) = av{e) and size s{ae) = as{e). We 
say that a fractional subset A is feasible if its support supp(j4) is feasible. Note that when the feasibility 
constraint can be expressed as a set of linear constraints, this relaxation is more restrictive than the natural 
linear relaxation. 

Note that since costs are a convex non-decreasing function of size, it may at times be more profitable 
to accept a fraction of an element rather than the whole. That is, argmax^ 7r(ae) may be strictly less than 
1. For such elements, p{e) < c(s(e)). We use F to denote the set of all such elements: ¥ = {e £ U : 
argmax^ 7r(ae) < 1}, and I = [/ \ F to denote the remaining elements. Our solutions will generally 
approximate the optimal profit from F by running Dynkin's algorithm for the classical secretary problem; 
most of our analysis will focus on I. Let F* (T) denote the optimal (feasible) fractional subset of T n I for 
a given set T. Then 7r(F*(r)) > tt{0*{T n I)). We use F* as shorthand for F*{U), and let s* be the size 
of this solution. 

In the multi-dimensional setting each element has an ^-dimensional size s(e) = (si(e), . . . , se{e)). The 
cost function is composed of i different non-decreasing convex functions, Cj : Z+ — )■ Z"*". The cost of a set 
of elements is defined to be C{A) = Ci{si{A)) and the profit of A, as before, is its value minus its cost: 

7t{A) = v{A) - C{A). 

2.1 Balanced Sampling 

Our algorithms learn the distribution of element values and sizes by observing the first few elements. Be- 
cause of the random order of arrival, these elements form a random subset of the universe U. The following 
concentration result is useful in formalizing the representativeness of the sample. 

Lemma 2.1. Given constant c > 3 and a set of elements I with associated non-negative weights, Wifor 
i £ I, say we construct a random subset J by including each element of I uniformly at random with 
probability 1/2. If for all k £ I, wj. < ^ Tliei'^i ^^^^^ the following inequality holds with probability at 
least 0.7G: 

> l3{c)^Wi, 

where /3(c) is a non-decreasing function of c (and furthermore is independent of I). 

We begin the proof of Lemma IZT] with a restatement of Lemma 1 from |[T2l since it plays a crucial role 
in our argument. Note that we choose a different parameterization than they do, since in our setting the 
balance between approximation ratio and probability of success is different. 

Lemma 2.2. Let Xi, for i > 1, be indicator random variables for a sequence of independent, fair coin flips. 
Then, for Si = 'Y^\=iX}., we have Pr[Vi, Si > > 0.76. 
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We now proceed to prove Lemma l2?n While we do not give a closed form for the approximation factor 
/3(c) in the statement of the lemma, we define it implicitly as 



f3{c) = max ( ) ( 1 



0<y<i \2 + yJ \ c(l-y)y ' 

and give explicit values in Tabled] for particular values of c that we invoke the lemma with. 

Proof of Lemma \2J\ Our general approach will be to separate our set of weights / into a "good" set G and 
a "bad" set B. At a high level, Lemma l2!2l guarantees us that at worst, we will accept a weight every third 
time we flip a coin. So the good case is when weights do not decrease too quickly; this intuition guides our 
definitions of G and B. 

Let y G (0, 1) be a lower bound on "acceptable" rates of decrease; we tune the exact value later. 
Throughout, we use w{S), w{S), and w{S) to denote the total, minimum, and maximum weights of a 
set S. We form G and B as follows. 

Initialize B = $ and i = 1. Consider / in order of decreasing weight. While / / 0, we repeat the 
following. Let P be the largest prefix such that w{P) > y • w{P). If \P\ < 2, move P from / to B, i.e. 
set B := B U P and I := I \ P. Otherwise, |P| > 3 and we define Gj to be the 3 largest elements of P; 
remove them from I (i.e. set I := I \ Gi); and increment i by 1. Once we are done, we define G = UjGj. 

First, we show that the total weight in B cannot be too large. Note that we add at most 2 elements at 
a time to B; and when we do add elements, we know that all remaining elements in / (and hence all later 
additions to B) we smaller by more than a factor of y. Thus, we can see that 

2w{B) 2w(I) 

by our assumption that no individual weight is more than w{I)/c. 

Next, we show that with probability at least 0.76, we can lower bound the fraction of weight we keep 
from G. Consider applying Lemma |2.2[ flipping coins first for the weights in G in order by decreasing 
weight. Note that by the time we finish flipping coins for Gi, we must have added at least i weights to J; 
hence the weight we add to J must have value at least w_{Gi). On the other hand, we know that 

w{G^) < 2w{Gi) + w{Gi) < + l) i^{Gi), 

and so summing over i we can see that elements we accept have total weight w{J) > {i^)w{G). 

Combining our bounds for the weights of G and B, we can see that with probability 0.76 the elements 
we accept have weight 

»W > (^) -(G) = (^) (»(/) - MB)) > (^) (l - ^) ..(/); 

optimizing the above with respect to y for a fixed c gives the claimed result. Note that for each fixed 
y G (0, 1) our approximation factor is increasing in c, and so the optimal value /3(c) must be increasing in c 
as well. □ 
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Figure 1: Some specific values of approximation ratio achieved by Lemma |2?T] 

3 Unconstrained Profit Maximization 

We begin by deveioping an algorithm for the unconstrained version of the generalized secretary problem 
with T = 2^ , which already exhibits some of the challenges of the general setting. Note that this setting 
captures as a special case the knapsack secretary problem of lH where the goal is to maximize the total value 
of a subset of size at most a given bound. In fact in the offline setting, the generalized secretary problem 
is very similar to the knapsack problem. If all elements have the same (unit) size, then the optimal offline 
algorithm orders elements in decreasing order of value and picks the largest prefix in which each element 
contributes a positive mai^ginal profit. When element sizes are different, a similar- approach works: we order 
elements by density this time, and note that either a prefix of this ordering or a single element is a good 
approximation (much like the greedy 2-approximation for knapsack). 



Algorithm 1 Offline algorithm for single-dimensional (vr, 2 ) 

1: Initialize set I {a G f/ | p{a) > c (s(a))} 

2: Initialize selected set ^ 

3: Sort I in decreasing order of density. 

4: for i G I do 

5: if marginal profit for the ith element, irj^{i) > then 
6: A^AU{i} 

7: else 
8: Exit loop. 
9: end if 

10: end for 

11: Set m := argmaxj^g^j 7r(a) {the most profitable element} 

12: if 7r(m) > 7r{A) then 
13: Output set {m} 

14: else 

15: Output set A 

16: end if 



Precisely, we show that |0* n F | < 1, and we can therefore focus on approximating vr over the set I. 
Furthermore, let A{U) denote the greedy subset obtained by Algorithm [TJ which considers elements in I 
in decreasing order of density and picks the largest prefix where every element has nonnegative marginal 
profit. The following lemma implies that one of A{U) or the single best element is a 3-approximation to 
O*. 

Lemma 3.1. We have that 7r(0*) < vr(F*) + maxegiy 7r(e) < ■k{A{U)) + 2maxeg;7 7r(e). Therefore the 
greedy offline algorithm (Algorithm\T^ achieves a 3-approximation for (vr, 2^). 
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Proof. We first show that O* has at most one element from F. For contradiction, assume that O* contains 
at least two elements /i, /2 G F. Since densities are unique, without loss of generality, we assume p(/i) > 

P{f2). 

Recall that F is precisely the set of elements for which it is optimal to accept a strictly fractional amount. 
Let ai, 02 < 1 be the optimal fractions for /i and /2, i.e. argmaxQ TT{afi) = ai and argmax^, 7^(0/2) = 
02- Then adding a fractional amount of any element with density at most p(/i) to {ai/i} results in strictly 
decreased profit. But this implies vr(0*) < 7r(0* \ {/2}), contradicting the optimality of O*. 

Let / be the unique element in O* D F. By subadditivity, we get that 7r(0* \ {/}) + 7r(/) > 7r(0*). 
Since O* \ {/} C I, we have 7t{F*) > 7r(0* \ {/}). In the rest of the proof we focus on approximating 
7r(F*). 

Note that F* is a fractional density prefix of I. So let F* = Pp- U {ae}, for some e and a < 1. The 
subadditivity of vr implies vr(F*) < vr(Pp-) + 7r({ae}). Note that Algorithm [T] selects A = Pp-, and that 
7r({ae}) < 7r(e) since e G I. 

Hence, combining the above inequalities we get 'k{A) + 7r(e) + vr(/) > 7r(0*). This in turn proves the 
required claim. □ 

The offline greedy algorithm suggests an online solution as well. In the case where a single element 
gives a good approximation, we can use the classical secretary algorithm to get a good competitive ratio. 
In the other case, in order to get good competitive ratio, we merely need to estimate the smallest density, 
say p , in the prefix of elements that the offline greedy algorithm picks, and then accept every element that 
exceeds this threshold. 

We pick an estimate for by observing the first few elements of the stream U . Note that it is important 
for our estimate of to be no smaller than p . In particular, if there are many elements with density just 
below p , and our algorithm uses a density threshold less than p , then the algorithm may be fooled into 
mostly picking elements with density below p" (since elements anive in random order), while the optimal 
solution picks elements with densities far exceeding p". We now describe how to pick an overestimate of 
p" which is not too conservative, that is, such that there is still sufficient profit in elements whose densities 
exceed the estimate. 

In the remainder of this section, we assume that every element has profit at most j;^^7r(0*) for an 
appropriate constant ki , to be defined later. (If this does not hold, the classical secretary algorithm obtains 

an expected profit of at least ^p;^^7r(0*)). Then Lemma [3H implies 7r(F*) > {l - jkh^ ^(O*), 

maxeec/ vr(e) < ^7r(F*), and 7r(^([/)) > (l - 17) <F*). 

We divide the stream U into two parts X and Y , where X is a random subset of U. Our algorithm 
unconditionally rejects elements in X and extracts a density threshold r from this set. Over the remaining 
stream Y, it accepts an element if and only if its density is at least r and if it brings in strictly positive 
marginal profit. Under the assumption of small element profits we can apply Lemma 12.11 to show that 
T^{Xr\A{U)) is concentrated and is a large enough fraction of vr(0*). This implies that with high probability 
T^{X n A{U)) (which is a prefix of A{X)) is a significant fraction of t^{A{X)). Therefore we attempt to 
identify X n A{U) by looking at profits of prefixes of X. 

We will need the following lemma about A{). 

Lemma 3.2. For any set S, consider subsets Ai,A2 ^ A{S). If Ai 5 A2, then tt{Ai) > Tr{A2). In other 
words, TT is monotone-increasing when restricted to A{S) for all S C U. 

Proof. We observe that the fractional greedy algorithm sorts its input S by decreasing order of density, and 
A{S) consists of the top |.4(5)| elements under that ordering. Since F*{S) contains each element in .4(5') 
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Algorithm 2 Online algorithm for single-dimensional (vr, 2^) 
1: With probability 1/2 run the classic secretary algorithm to pick the single most profitable element else 

execute the following steps. 
2: Draw k from Binomial(n, 1/2). 

3: Select the first k elements to be in the sample X. Unconditionally reject these elements. 
4: Let T be largest density such that 7r(P^) > /3 ^1 — 7r(F*(X)) for constants /3 and fci to be specified 
later. 

5: Initialize selected set 0-^0. 
6: (orieY = U\Xdo 

7: if 7r(0 U {i}) - 7r(0) > and p{i) > r and i ^ F then 
8: O^OU{i} 
9: else 

10: Exit loop. 
11: end if 
12: end for 



in its entirety, we can see that F*{B) = B for any subset B of A{S). So for A2 C Ai C A{S), we have that 
F*{A2) = A2 C ^1 = by the optimality of F* , this implies that ■k{A2) < it{Ai) as claimed. □ 

We define two good events. Ei asserts that X n A{U) has high enough profit. Our final output is the 
set . E2 asserts that the profit of P^ is a large enough fraction of the profit of Pr- Recall that A{U) h a 
density prefix, say Pp-, and so X n A{U) = P^. We define the event £'1 as follows. 

El : 7t{P^-) > P 7T{Pp-) 

where /? is a constant to be specified later. Conditioned on Ei, we have ■k{P^) > /? (1 — l/Zci) 7r(F*) > 
/3 (1 — 1/ki) ■k{F*{X)). Note that threshold r, as selected by Algorithm|2l is the largest density such that 
■k{P^) > /3 (1 — l/fei) 7r(F*(X)). Therefore, Ei implies r > and we have the following lemma. 

Lemma 3.3. Conditioned on Ei, O = n y C A{U). 

On the other hand, P^ C P^ c A{U) along with Lemma [X2] implies 

vr(PO > <Pr) > /3 (1 - lAi) <F*{X)) > /3 (1 - l/ki) 7t{P^-) > /S^ (1 - l/k^f 7r{F*) 

where the second inequality is by the definition of r, the third by optimality and the last is obtained by 
applying Ei and A{U) > (1 - l/ki) F*. 

We define p'^ to be the lai^gest density such that 7r(Pp+) > ^1 — 7r{F*). Then /?+ > r, which 
implies Pp+ C P^ and the following lemma. 

Lemma 3.4. Event Ei implies O D Pp+. 

Based on the above lemma, we define event E2 for an appropriate constant /3' as follows 

E2:7TiPj^)>^'7r{Pp^). 
Conditioned on events Ei and E2, and using Lemma again, we get 

vr(O) > 7r(Pj+) > /3'/32(l - l/ki)Mn- 
To wrap up the analysis, we show that Ei and E2 are high probability events. 
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Lemma 3.5. If no element ofU has profit more than -^t:{0*), then Pr[Ei A ^2] > 0.52, where /3 = 0.262 
and 13' = 0.094. 

Proof. We show that Pr[£^i], Pr[£'2] > 0.76; the desired inequahty Pr[£'i A E2] > 0.52 then follows by 
the union bound. 

In the following, we assume the elements are sorted in decreasing order of density. Denote the profit 
increment of the element by = 7r(P(j)) — 7r(P(-j_i)); this extends naturally to sets A C A{U) by 
setting tt{A) = J^ieA^i"^)- subadditivity of vr, we have vr(^) > tt{A) for all A C A{U), with 

equality at A{U). 

We apply Lemma lZTl with Pp- as the fixed set / and = U nPp- as the the random set J. The weights 
in Lemma [ZT] correspond to profit increments of the elements. Note that Pp- = A{U); so we know both 
that elements in Pp- have non-negative profit increments, and 7r(Pp-) > vr(0*) — 2 maxe^t/ 7r(e). Hence 
if no element has profit exceeding a 1/113-fraction of tt{0*), we get that any element G Pp- has profit 
increment 7f(i) < 7r(ej) < 1/(113 — 2)7r(Pp-) = l/lll7r(Pp-). Hence we can apply Lemma |2?T] to get 
T^{P^) ^ 0.2627r(Pp-) with probability at least 0.76, and so the event Ei holds with probability at least 
0.76. 

By our definition of p+, the profit of Pp+ is at least /3^(1 - l//ci)^(l - l/(/ci + l))7r(0*); substituting in 
the specified values of p and ki give us that no element in Pp+ has profit increment more than 2/15 7f(Pp+). 
Thus, applying Lemma Oimphes Ft[E2] > 0.76 with /?' = 0.094. □ 

Putting everything together we get the following theorem. 

Theorem 3.6. Algorithm^achieves a competitive ratio of 616 for (vr, 2^) using ki = 112 and /3 = 0.262. 

Proof. If there exists an element with profit at least -^j^Tr{0*{U)), the classical secretary algorithm (Step 1) 
gives a competitive ratio of > Otherwise, using Lemma [331 with /?' = 0.094, we have E[7r(0)] > 
E[7r(0) I ^iA^2] Pr[^iAP2] > 0.52/3'/32(l-l/A;i)2^(F*) > 0.52/3'/32(l-l/A;i)2 (1 - l/{ki + 1)) 7r(0*) > 
3gy7r(0*). Since we flip a fair coin to decide whether to output the result of running the classical secretary 
algorithm, or output the set O, we achieve a 2 max{308, 307} = 616-approximation to vr(0*) in expectation 
(over the coin flip). □ 

4 Matroid- Constrained Profit Maximization 

We now extend the algorithm of Section[3]to the setting (vr, T) where is a matroid constraint. In particular, 
T is the set of all independent sets of a matroid over U. We skip a precise definition of matroids and will 
only use the following facts: is a downward closed feasibility constraint and there exist an exact offline 
and an 0{^/logr) online algorithm for J^), where $ is a sum-of-values objective and r is the rank of the 
matroid. 

In the unconstrained setting, we showed that there always exists either a density prefix or a single element 
with near-optimal profit. So in the online setting it sufficed to determine the density threshold for a good 
prefix. In constrained settings this is no longer true, and we need to develop new techniques. Our approach 
is to develop a general reduction from the vr objective to two different sum-of-values type objectives over 
the same feasibility constraint. This allows us to employ previous work on the setting; we lose only 

a constant factor in the competitive ratio. We will first describe the reduction in the offline setting and then 
extend it to the online algorithm using techniques from Section [3l 
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Decomposition of vr. For a given density 7, we define tlie shifted density function h^{) over sets as 
h.y{A) := X^eGA ip{^) ~ 7) ^(^) ^'^^ \he fixed density function g-y{) over sizes as g^y^s) := 7s — C(s). 
For a set ^ we use g^iA) to denote g^y{s{A)). It is immediate tiiat for any density 7 we can split the profit 
function as tt{A) = h^{A) + g^{A). In particular 7r(0*) = h^{0*) + g^{0*). Our goal will be to optimize 
the two parts separately and then return the better of them. 

Note that the function is a sum of values function where the value of an element is defined to be 
(p(e) — 7)s(e). Its maximizer is a subset of P^, the set of elements with nonnegative shifted density 
p(e) — 7. In order to ensure that the maximizer, say A, of /i^ also obtains good profit, we must ensure that 
g-y{A) is nonnegative, and therefore i^{A) > h^{A). This is guaranteed for a set A as long as s{A) < 5(7). 

Likewise, the function g^ increases as a function of size s as long as s is at most 5(7), and decreases 
thereafter. Therefore, in order to maximize g^, we merely need to find the largest (in terms of size) feasible 
subset of size no more than §(7). As before, if we can ensure that for such a subset hj is nonnegative (e.g. 
if the set is a subset of P^), then the profit of the set is no smaller than its g^ value. This motivates the 
following definition of "bounded" subsets: 

Definition 4.1. Given a density 7 a subset A U is said to be ^-bounded if A C and s{A) < 1(7). 

We begin by formally proving that the function g^ increases as a function of size s as long as s is at most 
s(7)- 

Lemma 4.1. If density 7 and sizes s and t satisfy s < t < 5(7), then g-yis) < g-f{t). 

Proof. Since C(-) is convex, we have that its mai^ginal, c(), is monotonically non-decreasing. Thus we get 
the following chain of inequalities, 

t 

Q{t) - C(s) = ^ c(z) <{t-s)x c{t) < (t - 5)7. 

z=s+l 

The last inequality follows from the assumption that t is no more than 5(7) and hence c{t) < 7. By definition 
of gf^O we get the desired claim. □ 

Proposition 4.2. For any ^-bounded set A, 7r(A) > h^f{A) and 7r(A) > ^^(A). 

Proof. Since 'k{A) = h^{A) + 57(A), it is sufficient to prove that h^{A) and 57(A) are both non-negative. 
The former is clearly non-negative since all elements of A have density at least 7. Lemma |4~T] implies that 
the latter is non-negative by taking t = s{A) and s = 0. □ 

For a density 7 and set T we define and Gl^ as follows. (We write ^^7 for HI^ and G7 for G^.) 

= argmax h^{H) = argmax s{G) 

Following our observations above, both and Gy can be determined efficiently (in the offline setting) 
using standard matroid maximization. However, we must ensure that the two sets are 7-bounded. Further, 
in order to compare the performance of G7 against O*, we must ensure that its size is at least a constant 
fraction of the size of O*. We now show that there exists a density 7 for which Hj and Gj satisfy these 
properties. 

Once again, we focus on the case where no single element has high enough profit by itself. Recall that 
F* denotes F*{T), s* denotes the size of this set and P7 denotes P7 \ {67}. Before we proceed we need the 
following fact about the fractional optimal subset F*. 
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Lemma 4.3. If F* has an element of density 7 then s* is at most 5(7). 

Proof. The proof is by contradiction. Say s* is more than 5(7). Recall that denotes the element with 
density 7. We show that in such conditions reducing the fractional contribution of e^, say by e, increases 
profit giving us a better fractional solution. This is turn contradicts the optimality of F* . 
Write s = s(e^) and note that 

7r(F*) = v{F*) - C{s*) 

= [{v{F*) - 7 X es) - C(s* - es)] - [C(s*) - C(s* - es) - 7 x es] . 

If e is such that s* — e > 5(7), then we have C(s*) — C(s* — es) > 7 x es; thus we get that the term 
[C(s*) — C(s* — es) — 7 X es] is positive which proves the claim. □ 

Definition 4.2. Let be the largest density such that Pp- has a feasible set of size at least s*. 

We now state a useful property of p . 

Lemma 4.4. Any feasible set in Pp- is -bounded and has size less than s*. Moreover for any density 
7 > all feasible subsets of P^ are ^-bounded. 

Proof. By definition, the size of any feasible set contained in Pp- \ {cp-} is no more than s*. 

We will show that s* < s(p"). Then the first part of the lemma follows immediately. For the second 
part we have 7 > and hence s(p") < 1(7). Overall a size bound of s* also implies that feasible sets in 
P.y satisfy the size requirement for being 7-bounded. Hence we get that all feasible sets in Pp- \ {cp-} are 
p -bounded and all feasible sets in P-y are 7-bounded. 

The size of F* is at most the size of its support. Thus the support of F* is a feasible set of size at least 
s*. By definition, is the largest density such that Pp- contains a feasible set of size s*. Hence we get that 
F* contains an element of density less than or equal to p . Applying Lemma 1431 we get s* < s(p") and the 
lemma follows. □ 

The following is our main claim of this section. 

Lemma 4.5. For any density 7 > p", 7r(0*(-By)) < tt{H^) + 7r{G^). Furthermore, vr(0*) < vr(ifp-) + 
7r(Gp-) + 2 ma-Keeu 7r(e). 

Proof. Let P, H, and G denote Pp-, Hp-, and Gp- respectively. As in the unconstrained setting, there can be 
at most one element in the intersection of O* and F (see proof of Lemma ITTI ). Note that 7r() is subadditive, 
hence it{0* n I) + maxggc/ 7r(e) > vr(0*). In the analysis below we do not consider elements present in F 
and show that 7r{H) + 7r{G) + maxegj/ 7r(e) > 7r(0* n I). This in turn establishes the second part of the 
Lemma. 

For ease of notation we denote Cp- as e'. Without loss of generality, we can assume that H does not 
contain e' since p(e') — p" = 0. Therefore set H is contained in P \ {e'}. By Lemma l4!4l we get that H is 
p -bounded. 

Note that by definition G does not contain e', hence its size is at most s*. Also, Gu{e'} is no smaller than 
the maximum-size feasible subset contained in P. So, by definition of p", we also have s{G) + s(e') > s*. 
Thus there exists a < 1 such that the fractional set F = G U {ae'} has size exactly equal to s*. 

Next we split the profit of F* into two parts, and bound the first by hp-{H) and the second by gp-{F): 

7r(F*) = hp-{F*)+gp-{s*) < hp-{H) + gp-{F). 
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Note that we can drop elements which have a negative contribution to the sum to get 



hp-{F*) < hp-{F* n Pp-) 

< V(supp(F*)nPp-) 

< hp-{H) 

< tt{H). 

The second inequality follows since we can only increase the value of a subset by "rounding up" fractional 
elements. The third inequality follows from the optimality of H, and the fourth from the fact that it is 
p -bounded. 

To bound the second part we note that s{F) = s*, hence gp-{F) = p^s* — C(s*). Elements in F have 
density no less than and its size is bounded above by s(p"), hence it is a p" -bounded set implying that 
7r(F) > gp-{F). Note that F = GVJ {ae'}, and by sub-additivity of 7r() we have 7r(G) + 7r(ae') > 7r(F). 
Moreover e' G I implies 7r(ae') < vr(e') and hence we get 

T^{F*) < 7r{H) + g^{F) < tt{H) + 7r(G) + 7r(e'), 

which proves the second half of the lemma. 

The first half of the lemma follows along similar lines. We have the standard decomposition, it{0* {P-y)) = 
/i^(0*(-By))+(7-y(0*(-Py)). By definition, is the constrained maximizer of /i^, hence we get /i^(0*(P^)) < 
hy{Hy). We note that all feasible sets in Py are 7-bounded, for density 7 > p" (Lemma [4.41 ). Hence, by 
Lemma l4~n strictly increases with size when restricted to feasible sets in Py. is the largest such set, 
hence we get gy{Gy) > gy{0* {Py)). and G^ are 7-bounded and hence by Proposition 14.21 we have 
T^{Hy) > hy{Hy) and 7r(G^) > g^{G.y). This establishes the lemma. □ 

This lemma immediately gives us an offline approximation algorithm for (vr, J^): for every element 
density 7, we find the sets and G^ ; we then output the best (in terms of profit) of these sets or the best 
individual element. We obtain the following theorem: 

Theorem 4.6. Algorithm^A-approximates {tt,F) in the offline setting. 



Algorithm 3 Offline algorithm for single-dimensional (vr, F) 
1: Set^max ^ argmax^g|j:^^}^ 7r(if) 
2: Set^max ^ argmaxG6{G^}^ vr(G) 
3: Set Cmax ^ argmax^gj; 7r(e) 
4: Assign A{U) ^ argmax5g|^_^^^^ ^^^^^^ I tt[S) 



The online setting. Our online algorithm, as in the unconstrained case, uses a sample X from U to obtain 
an estimate r for the density p". Then with equal probability it applies the online algorithm for {hr,F) on 
the remaining set Y r\Pr or the online algorithm for (s, F) (in order to maximize gr) on y n Pr- Lemma l43] 
indicates that it should suffice for r to be larger than p" while ensuring that 'K{0*{Pr)) is large enough. As in 
Section |3] we define the density p+ as the upper limit on r, and claim that r satisfies the required properties 
with high probability. 
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Algorithm 4 Online algorithm for single-dimensional (vr, F) 

1: Draw from Binomial(n, 1/2). 

2: Select the first k elements to be in the sample X. Unconditionally reject these elements. 

3: Toss a fair coin. 

4: if Heads then 

5: Set output O as the first element, over the remaining stream, with profit higher than maxegx vr(e). 

6: else 

7: Determine A{X) using the offline Algorithm[3] 

8: Let /3 be a specified constant and let r be the highest density such that 'k{A{P^)) > ^■k{A{X)). 

9: Toss a fair coin. 

10: if Heads then 

1 1 : Let Oi be the result of executing an online algorithm for {hr^T) on the subset of the remaining 
stream with the objective function 

hM) = Y.^p{e)-r)s{e) 

12: Set O ^ 0. 

13: for e G Oi do 

14: if 7r(OU {e}) - 7r(0) > Othen 

15: SetO^OU{e}. 

16: end if 

17: end for 

18: Output O. 

19: else 

20: Let O2 be the result of executing an online algorithm for F on the subset Pj of the remaining 

stream with objective function s{). 

21: Set 0^0. 

22: for e G O2 do 

23: if 7r(0U {e}) - 7r(0) > Othen 

24: Set O ^ O U {e}. 

25: end if 

26: end for 

27: Output O. 

28: end if 

29: end if 
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Note that we use an algorithm for where $ is a sum-of-values objective in Algorithm |4] as a 

black box. For example if the underlying feasibility constraint is a partition matroid we execute the partition- 
matroid secretary algorithm in steps[TT]and[20]as subroutines. Since these subroutines are online algorithms, 
we can execute steps [TT] and |20] in parallel with the respective 'for' loops in steps [T3] and l22l This ensures 
that all accepted elements have positive profit increment. 

Definition 4.3. For a fixed parameter (3 < 1, let be the highest density with 7r{0*{Pp+)) > (/3/16)^7r(0*). 

Lemma 4.7. For fixed parameters ki>l,k2<XP^^ cind f3' < 1 suppose that there is no element with 
profit more than -^7r{0*). Then with probability at least k2, we have that if t is the highest density such 

that ir{A{P^)) > ^tt{A{X)), then t satisfies p+ > t > p- and n{0*{P^)) > /3' {/3 /16fTT{0*). 
Proof. We first define two events analogous to the events Ei and E2 in Section [3] 

E^:7r{0*{P^-))>(37r{0*{P,-)) 
i?2:7r(0*(Pj+))>/3V(0*(P,+ )) 

We claim that the event Ei immediately implies p"*" > r > p^. Furthermore, when p"*" > r > p^, we 
get the containment Pj+ C P^ C Pj-, which implies tt{0*{P^)) > ■k{0*{PJ+)). This inequality, when 
combined with event E2 and the definition of p"*", proves the second condition. We furthermore claim that 
El and E2 simultaneously hold with probability at least k2, which would give the desired result. 

Thus, we are done if we demonstrate that event Ei implies p^ >t > p , and that the probability of Ei 
and E2 occurring simultaneously is at least /c2; we now proceed to prove each of these claims in turn. 

Claim 1. Event Ei implies that p^ > t and so Pj^ C P^. 
Proof. First, by containment and optimality, we observe that 

7T{0*{Pr))>n{0*iP,^))>n{A{P,^)). 
By definition of r, we have 7r(^(P^)) > ^tt{A{X)). Furthermore, 

n{A{X)) > \7r{0*{X)) > \n{0*{P^-)). 

Lemma 1431 gives us 7r(0*(Pp-)) > ^tt{0*). This together with event Ei implies 

7riO*iP^-))>MO*iPp-))>^7riOn. 

Thus, we have that 7r(0*(i-V)) > ^(O*). We have defined p'^ to be the largest density for which the 

previous profit inequality holds. Hence we conclude that p^ > t. □ 

Claim 2. Event Ei implies that r > p^. 

Proof Asstatedbefore,Lemma|43]givesus7r(0*(Pp-)) > i7r(0*). We have that 7r(^(P;^)) > l7r{0*{P^)). 
Now, El implies that 

vr(0*(P,^))>MO*(^p-))>^vr(0*). 
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Combining these we get that 

vr(^(P,^))>^^(0*)>^vr(^(X)). 

Since r is the largest density for which the above inequality holds we have r > p". □ 

Claim 3. For a fixed constant ki, if no element of S has profit more than ^7r(0*) then Pr[Ei A E2] > 0.52. 

Proof. The proof of this claim is similar to that of Lemma [331 We show that Pr[£'i] > 0.76 and Pr[£'2] > 
0.76; the result then follows by applying the union bound. We begin by observing that -k{0*{P^)) > 
ir{0*{Pp-) n X), and so it suffices to bound the probability that 7r(0*(Pp-) n X) > /37r(0*(Pp-)) and 
likewise the probability that 7r(0*(P^+) DY) > /3'7r(0*(P„+)). 

We fix an ordering over elements of 0*{Pp-), such that the profit increments are non-increasing. That 
is, if Li is the set containing elements 1 thi^ough i and hence the profit increment of the ith element is 
7f(z) := 7r(Lj) — 7r(Lj^i), then we have 7r(l) > tt{2) > . . .. Note that such an ordering can be determined 
by greedily picking elements from 0*{Pp-) such that the profit increment at each step is maximized. 

We set ki > 24, now the fact that no element in S has profit more than ^'/r(0*) implies no element by 
itself has profit more than 1/8 7r(0*(Pp-)), since -K{0*{Pp-)) > l/37r(0*). Profit increments of elements 
are upper bounded by their profits, therefore we can apply Lemma [2?T] with 0*{Pp-) as the fixed set and 
profit increments as the weights. By optimahty of 0*{Pp-) we have that these profit increments are non- 
negative hence the required conditions of Lemma IZT] hold and we have 7r{0*{Pp-) n X) > /?7r(0*(-Pp-)) 
with probability at least 0.76. Since 7r(0*(P^)) > TT{0*{Pp-) n X) wegetPr[£;i] > 0.76 with ^ = 1/10. 

Bydefinitionof/j+ the profit of 0*(Pp+) is at least 7r(0*). With fci > 8 (|) and /3' = 1/10 we 
can again apply Lemma IZT] to show that Pr[£^2] > 0.76. Hence we can set ^2 = 0.52 and this completes 
the proof of the claim. □ 

With the demonstration that the above three claims hold, our proof is complete. □ 

To conclude the analysis, we show that if the online algorithms for {hr,F) and {s,F) have a competitive 
ratio of a, then we obtain an 0{a) approximation to 'n:{0*{Pj)). We therefore get the following theorem. 

Theorem 4.8. If there exists an a-competitive algorithm for the matroid secretary problem -F) where ^ 
is a sum-of-values objective, then Algorithm^achieves a competitive ratio of 0{a) for the problem (vr, T). 

Before we proceed to prove Theorem 14. 8[ we show that in steps |9] to |27] the algorithm obtains a good 
approximation to 0*{Pj). 

Lemma 4.9. Suppose that there is an a-competitive algorithm for where ^ is any sum-of-values ob- 

jective. For a fixed set Y and threshold r, satisfying r > p", we have Eo-[7r(Oi) + vr(02)] > a t^{0*{PJ)), 
where the expectation is over all permutations a ofY. 

Proof. The threshold r is either equal to or strictly greater than p . In the former case Cp- must have been 
in the sample set X and hence Oi, O2 ^ Pp - By Lemma l4!4l we show that Oi and O2 ai^e p -bounded and 
hence r-bounded. On the other hand if r > we can again apply Lemma l4!4l and get that Oi and O2 are 
r-bounded. 

Hence by Proposition|4i2]we get the inequalities E^[7r(Oi)] > [/i^(Oi)] and E^[7r(02)] > ^a[gTi02)]. 
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By applying the a-competitive matroid secretaiy algorithm with objective (Step 11 of Algorithm |4]) 
we get 

E.[/i,(Oi)] > a X hr{H^) 

>ax hr{0*{P^)), 

where the second inequality follows from the optimality of . 

Next we bound ¥jfj[gT-{02)]- Let K be the largest feasible subset contained in Pj . The fact that the 
underlying algorithm is a-competitive implies E(j[s(02)] > a x s{K). 

Note that, as observed above, Pj C Pp-. Since K C Pj , by definition of we get that s{K) < s*. 
So, for O2 we have 



E,[<),(02)] > E,[ts(02) - C(s(02)l 



s{K) 

S{02) 

s{K) 
> a{Ts{K) - 
= ol9t{K) 
>agr{0*{P^)). 



si02) 
s{K) 

{ts{K) - C{s{K))) 
C{s{K))) 



C{s{K)) 



Since s{0*{P^)) < s{K) < s* < s(r) we get the last inequality by applying Lemma |4~T] 

The conclusion of the lemma now follows from the decomposition Tr{0*{P^)) = hr{0*{Pj)) + 

9r{0*{Pj)). □ 

Proof of Theorem With probability ^ we apply the standard secretary algorithm which is e-competitive. 
If an element has profit more than ^7r(0*), in expectation we get a profit of 2^ times the optimal. 

We have Pr[i?i A E2] > k2, for a fixed constant k2. Also, the events Ei and E2 depend only on 
what elements are in X and Y, and not on their ordering in the stream. So conditioned on Ei and E2, the 
remaining stream is still a uniformly random permutation of Y . Therefore, if no element has profit more 
than ^7r(0*) we can apply the second inequality of Lemma 14. 7 1 and Lemma [49l along with the fact that we 
output Oi and O2 with probability ^ each, to show that 



E[7r(0)] > -E[7r(Oi)+7r(02) 



El A E2] X Vt[Ei a E2] 



> -^E[7riO*{P^))\EiAE2] 



> 



ak2/3' f 13 



16 1 '<°*>' 



Overall, we have that 



E[7r(0)] > min 



2kie 



tt{0* 



Since all the involved parameters are fixed constants we get the desired result. 



□ 
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5 Multi-dimensional Profit Maximization 



In this section, we consider the GSP with a multi-dimensional profit objective. Recall that in this setting 
each element e has £ different sizes si (e) , . . . , ^^(e), and the cost of a subset is defined by i different convex 
functions Ci, . . . , C^. The profit function is defined as vr(yl) = v{A) — Q(sj(A)). 

As in the single-dimensional setting, we partition U into two sets I and F with F = {e G C/ : 
argmax^, 7r(ae) < 1}. We first claim that, as before, an optimal solution cannot contain too many ele- 
ments of F. 

Lemma 5.1. We have that |0* n F | < £ 

Proof. Suppose, towards a contradiction, that \0* n F | > £ + 1. For i G {1, . . . , £}, let rrii be any element 
in O* with Si(mj) = maxego* Si{e). Since |0* n F | > £ + 1, there exists o G (O* n F) \ {mi, . . . , me} 
such that Si{o) < Si{0* \ {o}) for all i. This implies that when we compare the marginal cost of adding 
another copy of o to {o} against the marginal cost of adding o to O* \ {a}, we have by convexity 

^ Q(s,(o + a)) - Q(s,(o)) < Q{si{0*)) - Q(s,(0* \ {a})). 

i i 

Therefore, we have vr(0*) — 7r(0* \ {o}) < 7r(o + a) — it{o) < since o G F, and this contradicts the 
optimahty of O*. □ 

We therefore focus on approximating vr over I and devote this section to the unconstrained problem (vr, 2^). 

The main challenge of this setting is that we cannot summarize the value-size tradeoff that an element 
provides by a single density because the element can be quite large in one dimension and very small in 
another. Our high level approach is to distribute the value of each element across the £ dimensions, thereby 
defining densities and decomposing profit across dimensions appropriately. We do this in such a way that 
a maximizer of the ith dimensional profit for some dimension i gives us a good overall solution (albeit at a 
cost of a factor of £). 

Formally, let p : U ^ M.^ denote an ^-dimensional vector function p{e) = {pi{e), . . . , pe{e)) that 
satisfies Pi(e)sj(e) = v{e) for all e. We set Vi{e) = pi{e)si{e) and ■iri{A) = Vi{A) — Ci{si{A)) and 
note that -k{A) = ^ • TTi{A). Let F* denote the maximizer of vrj over I. Then, 7r(F*) < J2- 7ri{F*). 

Given this observation, it is natural to try to obtain an approximation to vr by solving for for all i and 
rounding the best one. This does not immediately work: even if TTi{F*) is very lai^ge, 'k{F*) could be neg- 
ative because of the profit of the set being negative in other dimensions. We will now describe an approach 
for defining and finding density vectors such that the best set F* indeed gives an 0{£) approximation to 
0*{1). We first define a quantity 11, (7^) which bounds the ith dimensional profit that can be obtained by any 
set with elements of ith density at most 7i: = m.a'x.t{'yit — Ci{t)). We can bound ■Ki{F^) using nj(-). 

Lemma 5.2. For a given density jj, let A = {a ^ F* : Pj{a) > jj}. Then T^j(F*) < T^j{A) + 

Proof. Subadditivity implies 7rj(F*) < TJ:j{A)+'Kj{F*\A). Since the elements e in F*\ A have /9j(e) < 7j, 
we have vrj(F* \ ^) < maxj 7jt - Cj(t) = nj(7j). □ 

In order to obtain a uniform bound on the profits ■Kj{F* ), we restrict density vectors as follows. We call 
a vector p{e) proper if it satisfies the following properties: 

(PI) E^P^ie)si{e) = v{e) 

(P2) Ili{pi{e)) = Ilj{pj{e)) for all i,j G [1,^]; we denote this quantity by n(e). 
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The following lemma is proved in Section [5?T] 

Lemma 5,3. For every element e, a unique proper density vector exists and can be found in polynomial 
time. 

Finally, we note that proper density vectors induce a single ordering over elements. In particular, since 
the HjS are monotone, Pi{e) > Pi{e') if and only if n(e) > n(e'). We order the elements ei, . . . , e„ in 
decreasing order of IT. Note that each F* is a (fractional) prefix of this sequence. Let F^ be the shortest 
prefix. Let A = {ei, . . . , 6^^} denote the integral part of F^ and e' = efci+i(/.e, F^'s unique fractional 
element if it exists). 

First, we need the following fact about F^. It implies that the multidimensional profit function vr is 
monotone when restricted to subsets of Ff. 

Lemma 5.4. Consider subsets Ai,A2 C Fj*. If Ai D A2, then ^{Ai) > tt{A2). Furthermore, vr(yl) > 
TTi{A) for all i if A C F*. 

Proof For any i, we have F^ C F^. Since F^ is the fractional prefix that optimizes vrj(-), applying 
Lemma [l!2l to 7rj(-) implies that 7^i{Ai) > 7ri{A2) > 0. By summing, we have 7r(Ai) > ir{A2), and also 
that tt{Ai) > 7ri{Ai). Setting Ai as A gives us the second claim. □ 

We get the following lemma by noting that 7ri(Fj*) > ni(e'). 

Lemma 5.5. For proper p{e)s and A and e' as defined above, for every i, TTi{F*) < 'iTi{A) + tti{F^) < 
2tt{F^). Furthermore, 7r{F*) < e{7r{A) + n{e')). 

Proof. We begin by proving that ni{Ff) > ni(e'). Let p'^ = mina^F* Pi{cL). We observe that pi{e') < 
Ci{si{F*)) since otherwise, we could have included an additional fractional amount of e' to F* and increased 
its profit. Together with Claim 1431 we have Si(e') < Si{F*) < Sj(p^). Thus, applying Claim |4~T] and the 
fact that p'- > pj(e'), we have 

n,(e') = p.(e')si(e') - Q(s,(e')) < p^e') - Q{s,{e')) < p'MF*) - Q{s,{Fn). 

Since F* is p^-bounded in the ith dimension, we have 'n-i{F*) > p'-Si{F*) — Ci{si{F*)) by Proposition 14.21 
This proves that 7Ti{F^) > Ui{e'). 

Since p is proper, we have nj(e') = ni(e'). Recall that e' is not in A, the integral subset of Ff, so we 
have nj(e') = ni(e') < 7ri(Fi*). Together with Lemma |5^ this gives us 

vr.(i^*) <7ri(^)+7ri(Fi*). (1) 

Now, Lemma El gives us 7r{F^) > 7Ti{F^) > 7ri{A) and tt{F^) > iriiF^), so we have 7ri(Ff) < 2tt{F^) 
as claimed. 

Summing Equation ([T]) over i and applying subadditivity to 7ri(Fj*), we have 

^ TT^iF*) < [n,iA) + 7ri(F*)] + ^F*) 

< J^7ri(^)+£(7ri(^)+7ri(ae')) 
= Tr{A) + l)7ri(^) + iTTiiae'). 
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Applying Lemma [541 we have vr(^) > 7ri(^) and 7r(ae') > 7ri(ae'), and so 

tt{A) + {£- l)7Ti{A) + £TTi{ae') < £{tt{A) + 7r(ae')). 
The conclusion now follows from the fact we ai^e considering only elements from I and ir{F*) = ^ • iri{F*) < 

Lemmas |5 . 1 1 and [531 together give tt{0*) < £{tt{A) + 2maxe7r(e)), and therefore imply an offline 
3£-approximation for (vr, 2^) in the multi-dimensional setting. 

The online setting. Note that proper densities essentially define a 1 -dimensional manifold in ^-dimensional 
space. We can therefore hope to apply our online algorithm from Section [3] to this setting. However, there 
is a caveat: the algorithm from Section |3] uses the offline algorithm as a subroutine on the sample X to 
estimate the threshold r; naively replacing the subroutine by the 0{i) approximation described above leads 
to an 0{£'^) competitive online algorithnlj. In order to improve the competitive ratio to 0{£) we need to 
pick the threshold r more carefully. 

Algorithm 5 Online algorithm for multi-dimensional (vr, 2^) 
1: With probability 1/2 run the classic secretary algorithm to pick the single most profitable element else 

execute the following steps. 
2: Draw k from Binomial(n, 1/2). 

3: Select the first k elements to be in the sample X. Unconditionally reject these elements. 

4: Let r be largest density such that Cr € X satisfies ■k{P^) + vr(eT-) > §7rj(F*(X)) for all i, for a 

constant /?. 
5: Initialize selected set 0-^0. 
6: for i G y = [/ \ X do 

7: if 7r(0 U {i}) - 7r(0) > and p{i) > r and i ^ F then 

8: O^OU{i} 

9: else 
10: Exit loop. 
11: end if 
12: end for 



We define r to be the largest density with tr ^ X such that for an appropriate constant /3, tt{P^) + 
Tr{er) > ^iTi{F*{X)) for all i. For a set T, let F*{T) denote the maximizer of tTj over T n I and let 
P*(r) = niF*{T) denote the shortest of these prefixes. Recall that P*{U) = . Let p- denote the 
smallest density in F*. That is, F* = P*{Pp-). Our analysis relies on the following two events: 

El : ^(P*(P^^)) > /37r(P*(Pp-)), K : 7t{0*{X)) > (3"tt{0*). 
El implies the following sequence of inequalities; here the second inequality follows from Lemma [531 

7r{P*{P^-)) > P^{F*i) > ^n,{Fn > ^vr.(i^*(X)). (2) 

This implies r > p . Formally, we have the following claim. 

''Note the (1 — factor in the final competitive ratio in Theorem l3.6l this factor is due to the use of the offline subroutine 

in determining r. 
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Lemma 5.6. Conditioned on Ei, we have t > p . 

Proof. Since C Pp- = A, Lemma [5H implies that P^ C F*{P^). Event Ei guarantees that P^ is 
non-empty, so let p' be the minimum density of P^ . We have P^ = P^, which implies P^ C P^ C 
P*{P^ ) and P^ = P^ = P^Ll{ep>}hy definition. So, we can write F*{P^ ) = F*{P^) = P^Ll{a'ep>} 
for some a' G [0, 1]. Using subadditivity and the fact that e^/ G I gives us 

7r{P^) + 7r(e,0 > 7t{P^ U {a'cp,}) = 7r(P*(P^^)). (3) 

Now, threshold r as selected by Algorithm[5]is the largest density such that 7r(P^)+7r(eT-) > f 7r(F^* (X)) 
for all i. Since step 4 of the algorithm would have considered 7r(P^) + 7r(ep/), Equations ^ and Q imply 
r > p'. By definition, p' > p , so we conclude that Ei implies t > p^. □ 

Furthermore, by definition, e,- S X and so Cr ^ P^ , which impUes that P^ C Pr- Since t > p^ 
implies P^ C Pp- = A, Lemma [s!6l gives us the following lemma. 

Lemma 5.7. Conditioned on Ei, we have O = (Z P^ d A. 

Summing over all dimensions and applying event Pj gives us 

£WP,^) + 7r(e.)) >^Y.^^{F:{X)) > ^ j;vr.(0*(X)) = ^7r(0*(X)) > ^7r(0*). 

i j 

So if we define p+ to be the highest density such that 7r(Pp+) + 7r(ep+) > ^-7r(0*), then we get p"*" > r. 
Then, as before we can define the event E2 in terms of p+ to conclude that 'k{PJ) is large enough. Formally, 
we define E2 for some fixed constant j3' as follows. 

E2:^{Pj^)>l3'^{Pp^). 

Since Pj:; C P^ (Z Pr C A, applying Claim El we have tt{0) = 7r(P^^) > ■k{PJ+). Therefore, 
conditioning on events Pi, Pj and P2, we get 

vr(0)>^(Pj+)>^vr(0*). 

To wrap up the analysis, we argue that the probability of these events is bounded from below by a 
constant. Using Lemma IZT] as in the proof of Lemma [33], if no element has profit at least ^7r(0*), 
then events Pi, P^ and P2 each occur with probability at least 0.76. Using a union bound, we have that 
Pr[Pi ^E[^ P2] > 0.28. This proves the following lemma. 

Lemma 5.8. Suppose that maxg 7r(e) < (1/A;3^)7r(0*). Then, Pr[Pi A P[ A P2] > 0.28. 

Via a similar argument as for Theorem l3.6[ we get 
Theorem 5.9. Algorithm\5\is 0{i) competitive for (vr, 2^) where vr is a multi-dimensional profit function. 
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5.1 Computing Proper Densities 

In this subsection, we show that proper p's exist and can be efficiently computed. To do this, we make the 
following assumptions about the marginal cost functions: 

(Al) The marginal cost functions Cj{-) are unbounded. 

(A2) They satisfy Cj{sj{U)) > maxggc/ v{e) for all i. 

Note that these assumptions do not affect Cj(t) for sizes t < Si{U), and so have no effect on either the 
profit function or the optimal solution as well. We first prove that there exists a proper p{e) for each element 
e. Observe that properties (PI) and (P2) together uniquely define p{e). Therefore, we only need to find x* 
satisfying the equation 

Y^UjHx*)s,ie) = v{ey, (4) 
j 

then our proper density p{e) is given by Pj{e) = IlJ^{x*). By Assumption (Al), Ilj is a strictly-increasing 
continuous function with 11^(0) = and lim^_j.oo 11^ (7) = 00, so its inverse IlJ^ is well-defined and is also 
a strictly-increasing continuous function with 11^^(0) = and limx_j.oo IlJ^{x) = 00. Thus, the solution 
X* exists. 

Next, we show that for any element e, we can efficiently compute p{e). In the following, we fix an 
element e, and use Sj and v to denote Sj{e) and v{e), respectively. We define Ij{t) = Ilj{cj{t)). Let x* be 
the solution to Equation |4] for the fixed element e and note that x* = n(e). 

In the following two lemma statements and proofs, we focus on a single dimension j and remove sub- 
scripts for ease of notation. First, we prove that we can easily compute I{t). 

Lemma 5.10. We have I(t) = c{t)t - C{t). 

Proof. Let t' = s(c(t)). Since t' is the maximum size r > t such that c(t) > c(r), by monotonicity, we 
have that c(-) is constant in [t, t']. This implies C(t') — C{t) = c{t){t' — t) and so, 

c{t)t' - C(t') = c{t)t - C{t). 

The lemma now follows from the fact that I{t) = n(c(t)) is the LHS of the above equation. □ 

Next, we prove a lemma that helps us determine 11^^ (x) given x. 

Lemma 5.11. Given x and positive integer t such that I{t) < x < I{t + 1), we have 

U-\x) = ^-±^. 
^ ' t 

Proof. By definition of /(•), we have n(c(t)) < x < n(c(t + 1)), and since n(-) is strictly increasing, we 
get 

c(t) < n~^(x) < c(t + i). 

By definition of s(-), this gives us t < s(n~^(x)) < t + 1, and therefore s(n~^(j;)) = t, since c(-) changes 
only on the integer points. 
Thus, we have 

X = U{U'\x)) = U~^{x)t - C{t) 
and solving this for 11"^ (x) gives us the desired equality. □ 
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Lemma 15.111 leads to the FiND-DENSlTY algorithm which, given a profit x, uses a binary search to 
compute nj^(x). Together with Lemma 15.121 this enables us to determine x* by first using binary search, 
and then solving linear equations. 

Algorithm 6 Given x and sizes Sj, find n~^(x) and ti, . . . ,t£ satisfying Ij{tj) < x < Ij{tj + 1). 
Find-Density (x, s) 
I: for j = 1 to £do 

2: Binary seaixh to find integral tj G [0, Sj{U)) satisfying Ij{tj) < x < Ij{tj + 1). 
3: Set pj ^ (x + Cj{tj))/tj. 

4: end for 

5: return (p, t), where p is the vector (pi, . . . , p^) and t is the vector [ti, . . . , t^). 



Lemma 5.12. Suppose we have a positive integer x such that 

Y,^jHx)sj < V < Y,^j\x + l)s„ (5) 

j j 

and positive integers ti, . . . ,t£ such that Ij{tj) < x < Ij{tj + 1) for all j. Then the solution x* to 
Equation dUl is precisely: 

-Ej^jitj)isj/tj) 



X 



Proof. Equation ^ and the monotonicity of the 11^ ^'s imply that x < x* < x + 1 and so, we have 
Ij{tj) < X* < X + 1 < Ij{tj + 1) for all j. Applying Lemma lB.l ll to dimension j gives us IlJ^{x*) = 
{x* — Cj{tj))/tj. Then, applying Equation dH), we get 



Si. 



v = j2ujHxns, = j2 

Solving this for x* gives the claimed equality. □ 
We need the following lemma to show that the binary seaixh upper bounds in both algorithms are coiTcct. 
Lemma 5.13. For proper density p, we have x* < Ij{sj(U)) for all dimensions j. 

Proof. If an element has zero size in dimension j, then we can ignore Cj (-). So without loss of generality, 
we assume that Sj > for all j. Property (PI) gives us u = piSi > pjsj and so pj < v/sj < v. From 
assumption (A2), we have that Cj{sj{U)) > v > pj. This implies that 

U,ip,)<Uic,is,{U))) = I,isjiU)) 

Since p is proper, we have Ilj{pj) = x* and this proves the lemma. □ 

Theorem 5.14. Algorithms FiND-DENSlTY and Find-Proper-Density run in polynomial time and are 
correct. 
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Algorithm 7 Given sizes sj and value v, find p satisfying (PI) and (P2). 
Find-Proper-Density (f, s) 
I: Binary search to find integral x G [0, minj Ij{sj{U))) satisfying 

where {6~,t) = Find-Density (x, s) and {S~^,t') = Find-Density (x + l,s). 
2: Set 

* '^-EjCj(tj)(gj/^j) 

3: for j = 1 to £ do 

4: Set pj ^ {x* +Cj{tj))/tj. 

5: end for 
6: return p 



Proof. Lemmas 15.111 and 15.121 show that given correctness of the binary search upper bounds, the output 
is correct. Lemma 15 . 1 3 1 implies that the binary search upper bound of Find-Proper-Density is correct. 
We observe that Find-Density is only invoked for integral profits x < Ij{sj{U)). Therefore, we have 
Ij (t') < x < Ij{t' + 1) for t' < Sj{U) and this proves that the binary search upper bound of Find-Density 
is correct. 

Since the numbers involved in the arithmetic and the binary search are polynomial in terms of Sj{U), 
Cj (sj ([/)), we conclude that the algorithms take time polynomial in the input size. □ 

The online setting. When the algorithm does not get to see the entire input at once, then it does not 
know Sj{U). However, we can get around this by observing that if we have sizes ti,. . . ,t£ satisfying 
Ijitj) > Ilj{pj) = X* for all j, then Ij{tj) > x* and hence miiij Ij{tj) suffices as an upper bound for the 
binary search in Find-Proper-Density. Since we invoke Find-Density for x < Ij{tj), we have that 
ti, . . . ,t£ suffice as upper bounds for the binary searches in Find-Density as well. 

By (A2), we have Cj{sj{U)) > v > pj for any proper p, so if we guess tj such that Cj{tj) > v, then 
we have Ij{tj) > '\lj{pj). Therefore, we set tj = 2"^, with m = 1 initially, increment m one at a time and 
check if Cj{tj) < v. Assumption (A2) and monotonicity of Cj(-) implies that tj < 2sj{U) and so it takes us 
ITT- < log(2sj(f^)) iterations to get Cj{tj) > v. Repeating this for each dimension gives us sufficient upper 
bounds for the binary searches in both algorithms. 

6 Multi-dimensional costs with general feasibility constraint 

In this section we consider the multi-dimensional costs setting with a general feasibility constraint, (vr, J^). 
As before we use an 0(l)-approximate offline and an a-competitive online algorithm for as a sub- 

routine, where $ is a sum-of- values objective. While we will be able to obtain an 0{a£) approximation in 
the offline setting, this only translates into an 0{ai^) competitive online algorithm. 

As in Section [21 we associate with each element e an ^-dimensional proper density vector p(e). Then 
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we decompose the profit functions vTj into sum-of-values objectives defined as follows. 

g\{A) = jiSi{A) - Ci{si{A)), 

KiA) = Y,iPiie)-7i)si{e). 

e<^A 

Let h^{A) = J2i h\iA) and g^iA) = J2^ 9\{^)- We have tt,{A) = hi^{A) + gi^{A) for all i and tt{A) = 
h^{A) + g-yiA). As before, G^^i = argmax^^p^ Si{A). 

We extend the definition of "boundedness" to the multi-dimensional setting as follows (see also Defini- 
tion gj]). 

Definition 6.1. Given a density vector 7, a subset A C is said to be ^-bounded if, for all i, A is 'ji- 
bounded with respect to Cj, that is, Pi{A) > 7^ and Si{A) < Si(7i). If A is ^-bounded and 7 is the minimum 
density of A, then we say A is bounded. 

The following lemma is analogous to Proposition 14.21 

Lemma 6.1. If A is bounded, then tt{A) > 'Ki{A) for all i. Moreover, if A is ^-bounded then for all i, we 
have 

7r(A) > g,{A) > gl^{A), 
7r(A) > h^{A) > hl^{A). 

Proof. We start by assuming that A is 7-bounded. For each dimension i, we get 7rj(A) > gly{A) > 
by Proposition 14.21 Thus, we have '7t{A) > g-f{A) > gj{A) by summing over i. A similar proof shows 
that IT [A) > hy{A) > hl^{A). Next, we assume that A is bounded. Let be its minimum density. Then 
hl,{A) > and gj^iA) > 0. This gives us TTi{A) = + gi^{A) > and so 7r{A) > TTi{A) for all i. □ 

As in the single-dimensional setting, our approach is to find an appropriate density 7 to bound h^{0*) 
and gy{0*) in terms of the maximizers of hl^ and gl^. We use Algorithm |3j the offline algorithm for the 
single-dimensional constrained setting given in Section |4l as a subroutine. We consider two possible sce- 
narios: either all feasible sets are bounded or there exists an unbounded set. We use the following lemma to 
tackle the first scenario. 

Lemma 6.2. Suppose all feasible sets are bounded. Let Di = argmaxnprrr q gi ttAD) be the result 
of running Algorithm\3\on the single-dimensional instance (vrj, where J- is the underlying feasibility 
constraint. Then, we have ^j7r(Dj) > j7r(0*). 

Proof. Fix a dimension i. Since all feasible sets are bounded, we have vr(L'j) > 'Ki{Di) by Lemma |6?T] 
Furthermore, Theorem [3] implies that ■Ki{Di) > jTTi{0*). Summing over all dimensions, we have 

i i i 

Now we handle the case when there exists an unbounded feasible set. 

Lemma 6.3. Suppose there exists an unbounded feasible set. Let be the highest proper density such that 
Pp- contains an unbounded feasible set Let D[ be the result of running Algorithm\3\on Pp- (i.e. ignoring 
elements of density at most p~) with objective VTj and feasibility constraint T. If e.p- G I, then we have 

4^7r(Z)0 + ^(max7r(Gp-,i) + 7r(ep-)) > ^(O*). 
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Proof. For brevity we use e' to denote Cp-. Let Of = O* n [Pp-) and O2 = \ ^i- subadditivity, 
we have vr(0*) < vr(0|) + vr(02)- We observe that all feasible sets in Pp- are bounded by definition of p . 
Thus, we can apply Lemma ld!2] and get 4 vr(L',-) > vr(OJ). 

Next, we approximate ■k{02)- We have hp-{02) < since O2 has elements of density at most p . So, 
we have 7r(0|) < Y,i a'p-iO^)- We know that for all i, 

n(e') = max(p7t - Q(t)) 

>p7Si(02*)-Q(s.(02*)) 

so we have £n(e') > 7r(02) and it suffices to bound n(e'). 

Let L be the unbounded feasible set in Pp-; note that its minimum density is p . Since L is unbounded, 
there exists j such that Sj{L) > Sj(e'). We know that Gp j maximizes Sj{-) among feasible sets contained 
in Pp- \ {cp-}. Therefore, we have Sj{Gp-j) + Sj{e') > Sj{L) > Sj{e'). 

Now consider two cases. Suppose that for all i, Si{Gp-^i) < Si(e'). Then, Si{Gp-j) < Sj(e'). For brevity, 
we denote Gp-j by G'. For all i, let a-i be such that Si{G') + ajSj(e') = Sj(e') and let a = miiij Oj. Without 
loss of generality, we assume that ai is the minimum. By definition, we have si{G' + ae') = Si(e') and 
Si{G' + ae') < Si(e') for i / 1. This implies that G' + ae' is a p -bounded fractional set and so 

7r(G' + ae') > g^-iG' + ae') 

= pisi{G' + ae') - Ci(si(G' + ae')) 
= pi(e')si(e')-Ci(si(e')) 

= n(e'). 

Since e' G I, we have £(7r(G') + 7r(e')) > m(e') > 7r(0^). 

In the second case, there exists a dimension i such that Si(e') < Si{Gp-^i). Without loss of generality, we 
assume that it is the first dimension. In this case, we define G' to be Gp-^i. Let ^ be the minimum density 
in G' . Note that /^i > p^^l < by definition of the Gp-/s. Since G' is bounded, we have si(e') < si(G") < 
Si(/Ui). Applying Lemma |4~T] over pi gives 

gliG') = piSi(G') - Ci(si(G')) > pisi(e') - Ci(si(e')). 

Since G' is p-bounded, by Lemma [6!2] we have 

7r(G') > gliG') 
> MiSi(e') 

= n(e'). 

This gives us in^G') = tt{0^). Overall, we get £(maxi vr(Gp-,i) + 7r(e')) > 7r(0^). □ 

We are now ready to give an offline algorithm. Let Z)^ j be the result of running Algorithm [3] on P^ with 
objective vr^ and feasibility constraint T. 

Finally, we use the above lemmas to lower bound the performance of Algorithm [H 

Theorem 6.4. Algorithm\8\gives a 7i-approximation to (vr, J^) where vr is an i-dimensional profit function 
and J- is a matroid feasibility constraint. 



-Ci(si(e')) 
-Ci(si(e')) 
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Algorithm 8 Offline algorithm for multi-dimensional (vr, F) 

1: Let Gmax = argmaxGg{G^,i}^,, 7r(G). 
2: Let L>max = argmax£,g:|^^^^|^^ 7r(L>). 
3: Let Cmax = argmax^g^ 7r(e). 

4: return the most profitable of Gmax , -Dmax , and Cmax ■ 



Proof. First, we consider only elements from I. If all feasible sets are bounded, then we get vr(Dinax) > 
^7r(0*) by Lemma [6!2l On the other hand, if there exists an unbounded feasible set, then we have 

^(4^(^max ) + vr(en,ax ) + ^(G^nax )) > 4 ^(A') + + max 7r(Gp-,i)) 

i 

> 7r(0* ni) 

by Lemma [631 For elements from F, Lemma lSTT] shows that 7r(emax ) > vr(0* r\¥)/£. This proves that the 
algorithm achieves a 7^-approximation. □ 

6.1 The online setting 



Algorithm 9 Online algorithm for multi-dimensional (vr, T) 

1: Let c be a uniformly random draw from {1, 2, 3}. 

2: Let i* be a uniformly random draw from {!,...,£}. 

3: if c = 1 then 

4: return Oq: the result of running Dynkin's online algorithm. 

5: else if c = 2 then 

6: return Oi: the result of running Algorithmic on (vTj* , J-"). 

7: else if c = 3 then 

8: Draw k from Binomial(n, 1/2). 

9: Let the sample X be the first A; elements and Y be the remaining elements. 
10: Determine A{X) using the offline Algorithm[8] 

11: Let /3 be a specified constant and let r be the highest density such that 7r{A{P^)) > ■^^'7r{A{X)). 
12: return O2' the result of running Algorithmic on with objective tTj* and feasibility constraint J^. 

13: end if 



In this subsection, we develop the online algorithm for constrained multidimensional profit maximiza- 
tion. First we remark that density prefixes are well-defined in this setting, so we can use Algorithm |4j the 
online algorithm for single-dimensional constrained profit maximization, in steps 6 and 12. In order to 
mimic the offline algorithm, we guess (with equal probability) among one of the following scenarios and 
apply the appropriate subroutine for profit maximization: if there is a single high profit element we apply 
Dynkin's algorithm, if all feasible sets are bounded we apply Algorithmic along a randomly selected dimen- 
sion, else if there exists an unbounded feasible set we first estimate a density threshold by sampling and then 
apply Algorithmic over a random dimension but restricted to elements with density above the threshold. 

Most of this subsection is devoted the analysis of Algorithm |9] when there exists an unbounded feasible 
set. We prove the overall competitive ratio of the algorithm in Theorem l6.7l 

We define p+ as follows. 
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Definition 6.2. For a fixed parameter /3 < 1, let p'^ be the highest density such that 7r(0*(Pp+)) > 



The following lemma is essentially an analogue of Lemma 14.71 

Lemma 6.5. Suppose there exists an unbounded feasible set. For fixed parameters ki > 1, < 1, 
/? < 1 and /3' < 1 assume that there does not exists an element with profit more than ^7r(0*). Then with 
probability at least k^, we have that r, as defined in Algorithm |9] satisfies 

1. p'^ > T > and 

2. n{0*{Pr))>f3'[^)\{0*) 

Proof Sketch: We define events Ei and E2 as in the single-dimensional setting: 

E^:7r{0*{P^-))>(37r{0*{P,-)) 
E2:n{0*{Pj^))>P'niO*{P^^)). 

We observe that the proof of Lemma 1477] primarily depends on the properties of density prefixes, in partic- 
ular that 7r(0*(P;^)) is a sufficiently large fraction of TT{0*{Pp-)), and that Algorithm [3js profit does not 
decrease when considering larger prefixes. Then Theorem [6]4] gives us 7r{0*{Pp-)) > ^7r(0*). 

Thus, a proof similar to that of Lemma 1477] validates Lemma [63] □ 
The following lemma establishes the competitive ratio of Algorithmic] in the presence of an unbounded 
feasible set. 

Lemma 6.6. Suppose that there exists an unbounded feasible set, and let a be the competitive ratio that 
Algorithm^achieves for a single-dimensional problem {Tii^T). Then, for a fixed set Y and threshold r, 
satisfying r > p~, we have Eo-[7r(02)] > ^ tt{0*{P^)), where the expectation is over all permutations a 
ofY. 

Proof. First, we prove that O2 is bounded. By the definition of p", since O2 Q Pj it suffices to show that 
P^ C Pp-, as all feasible sets in Pp- are bounded. The threshold r is either equal to or strictly greater than 
and it suffices to consider the former case. If r = p", then tp- must have been in the sample set X so it 
is not in P^ and so P^ C Pp-. 

Hence, by Lemma [6TT1 for all i we have 7r(02) > 7rj(02). Applying Theorem 14. 81 where the ground set 
is Pj gives us E^[7ri(02) \i* =i]> ^■Ki{0*{Pj)). Therefore, we have 



E.[7r(02)] = ^j;E.[7r(02)|r 

i 



>^Y.-'io-(p-)) 



We now prove the main result of this section. 
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Theorem 6.7. Let a denote the competitive ratio ofAlgorithm^\for the single-dimensional problem (vTj, T). 
Then Algorithm\9\achieves a competitive ratio of 0{al^) for the multi-dimensional problem (vr, F). 

Proof. If all feasible sets are bounded, we can apply Lemma [6^1 Then, 

E[vr(0i)]>i^E[7r,(0i)|i*=i] 

i 

i 

Suppose there exists an element with profit more than ^7r(0*). Since we apply the standard secretary 
algorithm with probability i, and the standard secretary algorithm is e-competitive, in expectation we get a 
profit of 3^ times the optimal. 

Finally, if no element has profit more than ^7r(0*) and there exists an unbounded feasible set, using 
the second inequality of Lemma [631 Lemma [631 and the fact that we output O2 with probability i, we get 

E[7r(02)] > ^E[^(02) I El A E2] Vt[Ei A E2] 
>^^■n'^{0*{P^))\El^E2] 

Overall, we have that 

EHO)l>.n.n{|^(JL)\j_,_L}^„0.). 
Since all the involved parameters are fixed constants we get the desired result. □ 
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